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In this work, the low rank approximation concept is extended to the non-equihbrium Green's function (NEGF) 
method to achieve a very efficient approximated algorithm for coherent and incoherent electron transport. 
This new method is applied to inelastic transport in various semiconductor nanodevices. Detailed benchmarks 
with exact NEGF solutions show 1) a very good agreement between approximated and exact NEGF results, 
2) a significant reduction of the required memory, and 3) a large reduction of the computational time (a 
factor of speed up as high as 150 times is observed). A non-recursive solution of the inelastic NEGF transport 
equations of a 1000 nm long resistor on standard hardware illustrates nicely the capability of this new method. 
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I. INTRODUCTION 

Modern semiconductor devices have reached such small 
dimensions that carrier confinement, interference effects 
and tunneling play an equally important role as incoher- 
ent scattering, momentum and energy relaxation doi^— 
The non-equilibrium Green's function (NEGF) method 
is among the most widely employed methods to describe 
carrier dynamics in open quantum systemsj^i^ In fact, 
the NEGF method is applied to a constantly growing va- 
riety of systems ranging from phonon transport,—!^ spin 
transport^ii^ electron dynamics in metalsfii"— organic 
moleculesi^ and fuUerenes,— "— and semiconductor nano- 
structuresp^"— Unfortunately, the basic NEGF equa- 
tions are numerically cumbersome and extremely time 
demanding to solve. Therefore, several different approx- 
imations for particular devices and situations have been 
developed to reduce the numerical costs. The recur- 
sive Green's function method reduces the peak numeri- 
cal burden to a device dependent sub-block matrix of the 
system's hamiltonian, and the computational cost scales 
linearly with the number of blocks but cubically with 
the block sizej ^^'^^ It is widely used for the simulations 
with one transport direction such as FinFETs^Ti^ and 
nanowire structure.— Mode space approaches in similar 
wire structures as well as the newly developed Equivalent 
transport mode method separate the transport direction 
from transverse confinement directions thus reducing the 
block size in each layeri^"— All these methods usually 
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require a clear distinction between the transport direc- 
tion and transverse degrees of freedom. When this dis- 
tinction gets blurred, as in the case of incoherent scat- 
tering, their numerical efficiency drops significantly. A 
very efficient method to solve ballistic NEGF equations 
is the contact block reduction method (CBR)i^"— How- 
ever, this method does not offer self-consistent incoherent 
scattering capability. Niche applications of the NEGF 
method have used sophisticated Wannier and Wannier 
Stark functions to represent the transport problem in the 
presence of many incoherent scattering mechanisms i^i^^ 
This specific basis representation, however, is custom 
made for quantum cascade lasers and super lattices. 

In this work, the low rank approximation (LRA) 
method is adapted to the NEGF equations of electrons 
in simiconductors in the presence of inelastic scattering 
on phononSf^i This method is an extension of the "ba- 
sis reduction method" of Greek et ali^i^ The concept 
of low rank approximation is inherited from data mod- 
eling in control theory^, machine learning^, signal pro- 
cessing^, bioinforinatics^ for microarray data analysis 
etc. In the framework of NEGF, the transport problem 
is transformed from the original basis representation, i.e. 
in this work real space in effective mass hamiltonian pre- 
sentation, to a more appropriate basis of quasi-particle 
states that are close to the quasi-particles of the actual 
device. In this representation, the number of required 
basis functions is much less than in the original space, 
which allows to reduce the numerical costs significantly. 
In so far, this method is closely related to the beforemen- 
tioned mode space approach. However, the LRA method 
is a generalization to that, since it does not have any pre- 
requisites to the device geometry. In addition, the LRA 
implementation of this work uses a third basis represen- 



tation to enable real space defined inelastic scattering 
mechanisms. 

In Sec. nil the method of this work is introduced, its 
numerical complexity is analyzed and differences of this 
method with existing approximations are discussed. In 
Sec. mil transport in homogeneous resistor and a reso- 
nant tunneling diode is calculated. The comparisons of 
exact NEGF calculations with the LRA approximated re- 
sults show the accuracy of the presented method. Limita- 
tions that this method (as every approximation approach 
does) faces are also discussed in this section. To exem- 
plify the computational strength of the LRA method, 
electronic transport in a 1000 nm resistor is calculated in 
the end of this section. Energy resolved density spectrum 
illustrates the transition from ballistic to drift diffusion 
transport in this resistor. The paper concludes with a 
summary in Sec. IIVI 



II. METHOD 

A. Low Rank Approximation Method 

NEGF calculations are time consuming since they in- 
volve the inversion and multiplication of matrices with 
the rank N of the system's hamiltonian. The fundamen- 
tal concept of the LRA method is to reduce the compu- 
tational cost by transforming the NEGF equations into a 
space of lower rank n and solving the equations therein. 
It is expected that the closer the basis functions of the 
lower rank space are to the physically relevant quasi- 
particlcs of the device, the better the LRA approximation 
is and the smaller the ratio n/N can be chosen. The so- 
lution of the NEGF equations and all observables can 
be transformed back into the original space after self- 
consistent calculation is achieved. In this way, the matri- 
ces that represent the Green's functions and self-energies 
still have the lower rank n, but the dimensionality N 
which is required to maintain compatibility with other 
equations (such as the Poisson equation) that might still 
be given in the original space. 

This method is exemplified on the stationary vertical 
transport in laterally homogeneous quantum well hetero- 
structures that are in contact with two charge reservoirs. 
The electron structure is represented in terms of a single 
band effective mass hamiltonian Hq that is represented 
in a basis of N position eigenfunctions 
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where fcii is the in-plane electron momentum and V (z) 
represents a position dependent potential. In the NEGF 
formalism, stationary transport is determined by four 



coupled partial differential equations 
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Here, the electronic retarded and lesser Green's func- 
tions are given by G^, G^, respectively.—"— D is the 
sum of all environmental Green's functions that incorpo- 
rate e.g. phonons, and E denotes the self-energies. The 
devices are in contact with two charge reservoirs, rep- 
resented with contact self-energies4i^ If not explicitly 
stated otherwise, all calculations in this work include in- 
elastic scattering by longitudinal acoustic phonons given 
by the scattering self-energies^2ii^di 
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with the energy-averaged Green's functions 
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The acoustic deformation potential and the material den- 
sity is denoted by Dae and p, respectively. The acous- 
tic phonon frequency is coq and Vs is the sound veloc- 
ity4^>i^iM The Debye frequency oJOa^ limits the width of 
the average. 

As the first step of this method, the n eigenfunctions of 
the free particle Hamiltonian Hq with Neumann bound- 
ary conditions are solved that have smallest eigen energy 
E,, 
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Hereby, n is chosen such that the energy E^ is about sev- 
eral UbT above the highest chemical potential of all leads 
(with the Boltzmann constant /cb and the temperature 
T). Thereby, all quasi-particles with energies below _E„ 
are appropriately considered in the calculation. This is 
essential to capture all occupied electronic states and to 
predict the density accurately. It is worth to mention 
that if only the transmission around a given energy Eq is 
required, it is sufficient to consider eigenstates of a few 
ksT around Eq. 

In the second step, the n orthonormal eigenstates 4n 
are set into the n columns of a A'^ x n dimensional matrix 
S. This matrix S is unitary in the n dimensional space 
fi spanned by the wavcfunctions 0^, but note that it is 
not unitary in the N dimensional real space of step one 
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B. Comparison with existing efficient NEGF algorithms 



To define the locafity/non-locality of scattering self- 
energies, the position operator of the real space dis- 
cretization X is transformed into the reduced rank basis 
in the third step 



Xs = S^XS. 



(8) 



The position operator X is a diagonal matrix, whereas 
the reduced rank matrix Xs is a dense matrix. Therefore, 
the operator Xg is diagonalized to find the reduced rank 
position eigenfunction basis {4'i} 



Xsi^i =Xiipi, i == 1,2,.. 
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These orthonormal basis functions {'>pi} define the 
columns of a squared, unitary n x n transformation ma- 
trix P. In the basis {^i} the NEGF equations Eqs. ^ 
read 
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Since every basis function ipi is associated with a position 
Xi the equations above are discretized in a reduced rank 
real space representation. Position dependent scattering 
self-energies such as the acoustic phonon scattering self- 
energy in Eq. ^ are then self-consistently solved with 
the non-equilibrium Green's functions in the numerically 
efficient reduced rank real space. The introduction of the 
reduced real space helps to avoid the back-transformation 
of the Green's functions into the original real space in 
the self-consistent calculation when position dependent 
scattering self-energy is calculated. 

Once the NEGF equations Eqs. ([TU]) are converged, 
the diagonal and the first off-diagonal elements of G^ 
are transformed back into the original rank N real space 
representation. Observables such as the density or the 
current density can then be evaluated in the original, high 
resolution real space. However, the rank of the Green's 
functions in the N dimensional system equals the dimen- 
sion of the space they are solved in, i.e. the rank equals 
n. The smaller n is compared to N, the more unreli- 
able the N dimensional spatial information is, i.e. the 
stronger deviations of the LRA results from the exact re- 
sults are. It will be shown as one of the example results 
in the next section, that the LRA approximated current 
density oscillates in the original space, although the phys- 
ical current of exact calculations is conserved. To predict 
current voltage characteristics in the LRA method, this 
inhomogenous current density is averaged over the device 
excluding areas within AN/n from the leads (where A is 
the average mesh point distance in the original real space 
representation) . 



It is important to highlight some differences of this 
method with other, well established efficient NEGF al- 
gorithms such as the CBR method^^i"— , mode space ap- 
proachea2Si21 and recursive Green's function methodi ^^i^^ 

In the CBR method, the NEGF equations are first 
transformed into an efficient representation to utilize the 
fact that ballistic calculations require only some sections 
of the retarded Green's function G^ to be solved. A 
rectangular transformation that reduces the rank of the 
NEGF equations is applied only after that first transfor- 
mation. Although the CBR method is very efficient, it is 
fundamentally limited to ballistic calculations. 

The mode space approach assumes a separation Ansatz 
for the wave functions of propagating quasi-particles. 
Typically, the Ansatz requires confined modes or plane 
waves perpendicular to the transport direction. The 
mode space approach allows a significant rank reduction 
of the NEGF equations. The computational burden is 
even further reduced if these modes are well separated 
in energy and the particle propagation does not couple 
different modes. However, if the device contains inhomo- 
geneities (impurities, non conformal confinement, etc.) 
the number of the modes is no longer a conserved quan- 
tum number. Then, the modes are coupled and the rank 
of the mode space has to be large to predict transport 
without loss of accuracy. 

The recursive Green's function method allows to limit 
the calculation of the retarded Green's function to se- 
lected parameter intervals of the propagation space (i.e. 
sub- matrices of G^, when G^ is represented in matrix 
form). This allows limiting ballistic NEGF calculations 
on the required elements of G^ only, which results in 
much faster transport solutions than the case when the 
complete G^ is solvedi^ NEGF that includes incoherent 
scattering, however, requires the full G^ which deterio- 
rates the advantages of this recursive method. 

In contrast to these three methods, the LRA method 
allows the inclusion of any incoherent scattering as well 
as arbitrary device geometries. The "modes/sub-block 
matrix" of the LRA method are device dependent wave 
functions that automatically include non-conformal con- 
finement - if such confinement appears. 



C. Numerical Complexity and Memory Usage Analysis 

Coherent quantum transport calculations for realisti- 
cally extended devices have been shown to efficiently con- 
sume the computational power of over 220,000 process- 
ing cores4^i^ Incoherent NEGF based calculations re- 
quire about lOOx more computational power, are limited 
to generally unrealistical small structures, and can only 
scale to about 100,000 cores42ii^ Involving incoherent 
scattering in realistically extended devices requires dra- 
matically large computational resources. 



The numerical complexity and memory usage of self- 
consistent NEGF calculations reduce when the LRA 
method is applied. To qualify that, this section compares 
the number of floating point operations and the mem- 
ory usage of a " conventional" NEGF calculation with an 
approximated ballistic NEGF solution that employs the 
LRA method. In the following, the transport problem is 
assumed to be originally discretized with N orthogonal 
basis functions. Within the LRA method, the rank of the 
NEGF equations is reduced down to n. The energy and 
other conserved quantum numbers of the NEGF equa- 
tions are resolved with a mesh of Ne points. To get a 
conserved current density within the self-consistent Born 
approximation, Ni iterations of the Green's functions and 
self-energies are required. The integral in Eq. ([3|) is solved 
with Nph energy points. 

Exact NEGF: The exact solution of one retarded 
Green's function involves the inversion of a A^ 
dimensional matrix which requires O [N^) float- 
ing point operations. The solution of each lesser 
Green's function involves two matrix-matrix prod- 
ucts with a numerical load of O {N^) floating point 
operations. The solution of the local scattering 
self-energy of Eq. ([3]) is O {Np^ x N) for each en- 
ergy point in each iteration. In total, solving 
the NEGF equations exactly requires Ne x Ni x 
(O (N^) + O {Nph X N)) floating point operations, 
while the memory needed to store the matrix rep- 
resentation of Green's functions and self-energies is 
Ne X O {N'^) floating point numbers. 

Approximate NEGF: The LRA method can be de- 
composed into three steps: 1) the transformation of 
the NEGF equations into the reduced space, 2) the 
solution of the NEGF equations within the reduced 
space and 3) the back-transformation of some rel- 
evant results into the original space. Step 1) re- 
quires first to get the eigen states that construct the 
transformation matrix, i.e. O (N'^n) floating point 
operations. The memory used to store the trans- 
formation matrix is O (Nn) floating point numbers. 
The transformation of the device's hamiltonian into 
the reduced space requires then O (^N'^nj floating 
point operations. The contact self-energies have to 
be transformed for every energy point in every iter- 
ation. Since the contact self-energy is zero except 
for the mesh points adjacent to the leads in the 
original space, each transformation requires only 
Ne X Ni X O (n^) floating point operations and 
Ne X O (n^) floating point numbers to be stored. 
Solving the NEGF equations in the reduced space 
in step 2) requires Ne x NiX O (n^) floating point 
operations and memory usage of Ne x O (ri^j float- 
ing point numbers. The calculation of the acous- 
tic phonon self-energy costs O {Nph x n) operations 
in the reduce real space for each energy point in 
each iteration. To calculate the energy resolved 
densities and current densities^i^i^ in the orig- 



inal space, the step 3) requires to back-transform 
the diagonal and the two flrst off-diagonals of G^ 
of the original space. This transformation requires 
Ne X O {Nn^^ floating point operations and mem- 
ory usage of Ne x O [N) floating point numbers. 

In a typical effective mass NEGF calculation, the simu- 
lation setup reads A^ ^ 100, n = 10, A^^ = 1000, A^^ = 10 
and Nph — 10. According to the analysis above, the 
numerical complexity of standard NEGF calculation in 
the typical effective mass situation is O (10"'^°) floating 
point operations, and the memory usage is O (lO^) float- 
ing point numbers; the numerical complexity of LRA ap- 
proximated NEGF calculation is O (lO^) floating point 
operations, and the memory usage is O (lO^) floating 
point numbers. This observation demonstrates clearly 
that LRA method can reduce both numerical cost and 
memory usage signiflcantly. 

The comparison of the amount of floating point op- 
erations and memory usage between the exact and the 
approximated LRA approach illustrate that the LRA 
method offers approximated solutions of the NEGF equa- 
tions much faster and with a much smaller memory load 
than the exact solutions. In fact, one can easily flnd 
NEGF equations of state of the art devices that are only 
solvable when the LRA method is applied. To illustrate 
this, section IIIICI shows LRA approximated NEGF re- 
sults of a 1000 nm homogeneous resistor with inelastic 
acoustic phonon scattering calculated on single CPU. 



III. RESULTS AND DISCUSSIONS 

All devices in this section are laterally homogeneous 
layers grown in the z-direction. Stationary transport 
along the z direction is calculated for conduction band 
electrons in the effective mass approximation. In the 
original real space discretization (i.e. before LRA trans- 
formations are applied), the Green's functions and self- 
energies are functions of two propagation coordinates z 
and z', the absolute in-plane momentum fcy and the elec- 
tron energy E. All devices of a given length L are consid- 
ered to be in contact with two charge reservoirs at z = 
and z = L, respectively. 



A. Homogeneous structure 

Conduction band electrons of a 50 nm thick, homo- 
geneous layer of GaAs with an effective mass of m* = 
0.067 nifr^ are considered in this section. The NEGF 
equations are discretized with a 0.5 nm mesh spacing. 
The Fermi energies in both leads are assumed to agree 
with the respective conduction band edge. The temper- 
ature is set to 300 K. The conduction band in the device 
is set to be constant in the first and last 5 nm of the 
device and to drop linearly by the amount of the applied 
bias voltage in the central 40 nm of the device. 



3.0 



< 

m 

O 



<A 

C 

a> 
Q 

c 
o 

O 



■ / \ 



2.5 
2.0 
1.5 H 



1.0 '-; 
l' 

1/ 

0.5 i 

f 

0.0 



I / 



. r- — ^-^y 



I .' 
I / 
1/ 



V = 0.2 V 



■ Exact Results 
20% Matrix Rank 
■10% Matrix Rank 



"u 
i' 
i! 



10 



20 



30 



40 



50 



Position in z direction (nm) 



Figure 1. Spatially resolved current density in the homoge- 
neous structure described in the main text Sec. IIII Al with a 
linear potential drop of 0.2 eV. The lines show result cal- 
culated with the NEGF method solved exactly (solid) and 
with the NEGF method solved approximately with a reduc- 
tion of the matrix rank down to 20% (dashed) and 10% (dash- 
dotted). 
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Figure 2. Calculated electron density of the homogeneous de- 
vice of Fig. \T\ in equilibrium (a) and when a linear potential 
drop of 0.2 eV is assumed (b). Results of an exact NEGF 
calculation (solid) are compared with approximated NEGF 
calculations where the NEGF equations' matrix rank is re- 
duced down to 20% (dased) and 10% (dash-dotted). 



Figure [T] shows the spatially resolved current density 
that results from an exact NEGF calculation as well as 
current densities of LRA calculations when the matrix 
rank is reduced to 20% and 10% of the original space. 
The exact calculation yields a spatially constant current 
in the device, since inelastic phonon scattering is included 
through a converged self-consistent Born approximation. 
At the device boundaries the matrix elements of the exact 
contact self-energy is non-zero as is common in the NEGF 
method. This non-vanishing self-energy allows electrons 
to enter and leave the device, thus this contact self-energy 
violates current conservation at the device boundaries. 
In the LRA method, the contact self-energies are trans- 
formed into dense matrices. Their largest elements are 
still located close to the device boundaries, which causes 
the largest current fluctuations there. The larger the ma- 
trix rank reduction is, the larger contact self-energy ma- 
trix elements within the device are. Consequently, the 
larger the rank reduction is, the maximum amplitude 
of current density fluctuations within the device is the 
higher. The smaller the rank of the reduced real space 
is, the more dense the contact self-energies are. This al- 
lows electrons to leave/enter the device at/to any device 
point in the reduced rank space. The non-constant cur- 
rent densities in Fig. [T]in the original real space indicate 
this kind of violation of particle conservation. Similar to 
the current fluctuations the density deviates from the ex- 
act solution stronger, if the rank of the NEGF equations 
is reduced more: Figure [2] shows the electron density in 
the homogenous layer of GaAs in equilibrium and at fi- 
nite applied bias voltage. In both cases, the deviations 
are strongest close to the leads. 



Both of the above figures indicate that the LRA 
method can reproduce exact NEGF results in the device 
center up to close to the leads. This motivated the device 
average of the current density described in Sec. IllAI All 
remaining current densities in this paper are such device 
averaged results. 

Figure [3] shows 1-V characteristics of the 50 nm thick 
homogeneous GaAs layer that have been calculated in 
the exact NEGF method, as well as in the approximate 
LRA method with various rank reduction levels. In addi- 
tion. Fig. |3] also shows an exact NEGF calculation of the 
same device with a ten times coarser grid mesh. Similar 
to previous figures, the deviation of the LRA approxi- 
mated from the exact NEGF results is larger, the larger 
the rank reduction is. Nevertheless, a reduction of rank 
to 10% is still able to well reproduce the 1-V characteris- 
tics, since only a small fraction of electronic states in the 
low energy range contributes to the current density. In 
contrast, results of exact NEGF calculations with a ten 
times coarser grid deviate significantly from the full rank 
result: Such a coarser real space mesh yields a different 
effective electron dispersio n^^'^^ that deviates from the 
parabolic dispersion within the relevant energies. 

It is worth to mention that the boundary conditions for 
the electronic wave functions of Eq. ([5|) are relevant for 
the efficiency of the LRA method. In agreement to simi- 
lar findings of Mamaluy et al.;^"— basis functions with 
Dirichlet boundary conditions turned out to be inferior 
to Neumann conditions. 

The LRA method and standard NEGF calculations 
were implemented in Matlab with 8 cores parallelization. 
For this concrete homogeneous structure calculation, the 
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Figure 3. Comparison of I-V characteristic of the structure of 
Fig. [T] determined in exact NEGF calculations (solid) and in 
approximated NEGF solutions where the NEGF matrix rank 
is reduced to 20% (dashed) and 10% (dash-dotted). Also 
shown are the results for an exact NEGF calculation of the 
same device when it is discretized with a 10 times coarser real 
space mesh (dotted). 
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Figure 4. Conduction band profile (solid line) and contour 
plot of the energy resolved electron density of the RTD struc- 
ture described in the main text Sec. lIIIBl The energy resolved 
density is calculated within the NEGF method exactly (a) and 
approximately by a reduction of the NEGF equations' matrix 
rank to 10% (b). The filling of bound state in triangular well 
by inelastic acoustic phonon scattering is well captured in the 
approximate method. 



measured computational time for matrix rank reductions 
down to 10% and 20% is reduced by factories of 35X and 
87X respectively compared to the full solutions, as listed 
in Table HI In our non-optimized LRA Matlab implemen- 
tation, most of the time is spent to transform Green's 
functions between different basis representations. If fur- 
ther optimization on matrix transformation is performed 
(as discussed in Sec lIICp . the factor of speed up can be 
even larger. 



B. Resonant tunneling diode 

This section explores the compatibility of the LRA 
method in quantum confined systems. The NEGF equa- 
tions are solved in a 80 nm GaAs/Alo.sGao.yAs reso- 
nant tunneling diode (RTD) structure at 100 K. The 
RTD consists of two 3 nm wide Alo.3Gao.7As barriers 
and a 5 nm quantum well in the center. In addition, 
a 40 nm flat band region is located at emitter region. 
The effective mass for GaAs is 0.067 mo and 0.0919 mo 
for Alo.3Gao.7As42ii^i^i^ The band offset between these 
two materials is 230 meV4Si^'^i^ In the original real 
space representation, the device is discretized with a grid 
spacing of 0.5 nm. The Fermi energies in the leads arc 
set 0.005 meV beneath the respective conduction band 
edges. The potential profile is assumed to be constant in 
the left most 40 nm and to drop linearly in the remaining 
RTD region. This is illustrated by the solid line in Fig. |4] 
(a) and (b) which show the same assumed conduction 
band profile of the RTD in an exact NEGF calculation 
(a) and a 10% LRA approximated NEGF calculation (b). 



Figures 131(a) and (b) also show contour graphs of the en- 
ergy and spatially resolved electron density of the RTD at 
vanishing in-plane momentum (A:|| = 0) 32,33,40,43,52 -^^j-^q^ 
a voltage of 0.1375 V is applied. Both results agree very 
well: Fig. 2] (b) deviates from (a) only at the energy of 
about 0.07 eV and positions z G [65,80]. Even the con- 
fined state in the triangular quantum well locating at 
left of the first RTD barrier (at energy of about 0.05 eV 
and position 50 nm) is well reproduced in the LRA cal- 
culation. This is remarkable, since electrons can enter 
this state effectively only via inelastic scattering. There- 
fore, inelastic scattering and tunneling are well repro- 
duced with the LRA method. That can also be seen in 
Fig. [5l which shows the I-V characteristics of this RTD 
structure calculated in the exact NEGF method, as well 
as in the LRA method with 10% and 3.1% of the origi- 
nal matrix rank. Neither the current amplitude nor the 
resonance value get significantly altered when the NEGF 
equations are solved with only 10% of the original matrix 
rank. If the matrix rank is reduced too much, electronic 
states that are relevant for the transport are neglected. 
Consequently, the current density starts to deviate then. 
This is illustrated in Fig. [5] with the I-V characteristic re- 
sults of a LRA approximated NEGF calculation of only 
3.1% of the original matrix rank. As stated in Sec. Ill Al 
the ratio of the matrix rank reduction can be estimated 
from the energy interval in which the states are occupied. 

For this RTD example, the measured computational 
time for matrix rank reductions down to 10% and 20% is 
reduced by factories of 39X and 150X respectively com- 
pared to the full solutions, as listed in Table HI 





10% Matrix Rank 


20% Matrix Rank 


Exact Solution 


50 nm resistor 


42s/87X 


104s/35X 


3658s/lX 


RTD structure 


287s/ 150X 


1107s/39X 


43058s/ IX 


1000 nm resistor 


too aggressive reduction 


459hr 


memory exceed 



Table I. The measured time consumptions for tlie tliree examples in Sec. lIIII for matrix rank reductions down to 10% and 20% 
as well as exact NEGF solutions are listed. The LRA method and standard NEGF calculations were implemented in Matlab 
with 8 cores parallelization. 
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Figure 5. I-V characteristic of the RTD structure of Fig. |4] 
calculated exactly (solid line) and approximately with 10% 
(dashed) and 3.1% (dash-dotted) of the original NEGF equa- 
tions' matrix rank. 



C. lOOOnm long GaAs resistor 



2.5 



2.0 



E 1.5 



U) 

I 1-0 

c 
a> 

3 0.5 
O 



0.0 



Ballistic ^ 




- - D = 27 eV ^ L- - 


— 


" D = 60eV ^'^^^ ' ' ' 

■ ■ ■ D = 135eV ^/^ " " 






- 


// ^ 


■ 




- 


/<■■'. 





0.00 0.03 0.06 0.09 

Applied Voltage (V) 



0.12 



0.15 



Figure 6. The I-V characteristic of the 1000 nm homogenous 
structure described in the main text Sec. IIIICl when different 
values for the deformation potential are used: 27 eV (dashed), 
60 eV (dash-dotted) and 135 eV (dotted), and the ballistic 
results are shown as a solid curve. All results are determined 
from approximated NEGF equations with a matrix rank of 
20% of the original rank. 



The new LRA method is not only more efficient than 
the standard NEGF approach, it also opens up a space of 
device configurations that previously could not be tack- 
led. This section considers electronic transport in the 
presence of inelastic phonon scattering in a 1000 nm long 
homogeneous GaAs layer. In the range of 100 nm within 
the source and the drain contact/device interface, the 
conduction band is assumed to be constant. In the re- 
maining device, the conduction band drops linearly ac- 
cording to the applied bias voltage. The temperature of 
the phonon bath and the electrons in the leads is 300 K 
and the Fermi levels of the leads are set 0.1 eV above the 
respective conduction band edge. The system is origi- 
nally discretized with a mesh spacing of 1 nm. The re- 
sulting NEGF equations are approximated with a 20% 
matrix rank. This reduces the numerical complexity of 
the NEGF equations such that they have been solved on 
a single CPU without recursive approaches. The nature 
of the transport is tuned from purely ballistic to almost 
drift diffusion like by increasing the deformation poten- 
tial D of the phonon scattering self-energy of Eq. ([3]). 
Hereby, three different scattering potentials have been 



considered: 27 eV, 60 eV and 135 eV, which corresponds 
to a scattering rate of 1 x 10^^ s~^, 5 x 10"^^ %~^ and 
2.5 X 10^"^ s~^ for electrons with kinetic energy of 0.3 eV. 
The impact of the scattering is illustrated in Fig. |6] as 
it shows the calculated I-V characteristics of the device 
with various scattering strengths. The I-V characteristic 
is almost ohmic in the case of a deformation potential of 
135 eV. The nature of transport at this large deforma- 
tion potential can be understood from Fig. [T] It shows 
the energy and spatially resolved electron density for the 
1000 nm long resistor in the case of 0.1 V applied bias 
voltage. Electrons that originate from the source contact 
propagate about 400 nm in the device before they start to 
significantly dissipate energy. Then, however, these elec- 
trons follow the potential drop of the device and thereby 
start to maintain a local equilibrium distribution. In this 
way, the electrons experience a transition from effectively 
ballistic transport into the drift diffusion of the rightmost 
500 nm of the device. 
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Figure 7. Energy resolved electron density of the 1000 nm 
long structure of Fig. [6] with a deformation potential of 135 eV 
and a potential drop of 0.1 V. 
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